A Multi-stage Method for Text-To-Pronunciation Conversion
نویسندگان
چکیده
Text-to-Pronunciation conversion is often used for speech synthesis and speech recognition-related systems. In this paper we present a data-driven, language-independent and multi-stage model for Text-to-Pronunciation conversion. With a Grapheme/Phoneme pair well aligned dictionary for training and utilizing a re-scoring strategy for those graphemes likely to be tagged erroneously, our model can not only increase the efficiency but also achieve a high accuracy than other data-driven approaches that have been applied to the same tasks.
منابع مشابه
Multi-Stage DC-AC Converter Based on new DC-DC converter for energy conversion
This paper proposes a multi-stage power generation system suitable for renewable energy sources, which is composed of a DC-DC power converter and a three-phase inverter. The DC-DC power converter is a boost converter to convert the output voltage of the DC source into two voltage sources. The DC-DC converter has two switches operates like a continuous conduction mode. The input current of DC-DC...
متن کاملGrapheme-to-Phoneme Conversion for Amharic Text-to-Speech System
Developing correct Grapheme-to-Phoneme (GTP) conversion method is a central problem in text-tospeech synthesis. Particularly, deriving phonological features which are not shown in orthography is challenging. In the Amharic language, geminates and epenthetic vowels are very crucial for proper pronunciation but neither is shown in orthography. This paper describes an architecture, a preprocessing...
متن کاملGrapheme-to-phoneme conversion for Chinese text-to-speech
This paper reports a study of grapheme-to-phoneme (G2P) conversion for Chinese text-to-speech (TTS) system. As Chinese is a syllabic language, syllable is commonly adopted as the phonetic unit in TTS, which is represented by pinyin, the standard Chinese romanization. A Chinese G2P conversion is to find correct pinyin for polyphonic graphemes in the input text. In this paper, a complete G2P fram...
متن کاملOn the Pronunciation of Common Lexica and Proper Names in European Portuguese
This paper presents some relevant aspects of the pronunciation of proper names and common lexica in European Portuguese. It starts by a brief description of statistical data concerning the occurrence and distribution of graphemes and phonemes for the two corpora and the distinction between di erent subclasses found in proper names, namely rst and last names, toponyms and acronyms. The central t...
متن کاملA Multi-Strategy Approach to Improving Pronunciation by Analogy
Pronunciation by analogy (PbA) is a data-driven method for relating letters to sound, with potential application to next-generation text-to-speech systems. This paper extends previous work on PbA in several directions. First, we have included "full" pattern matching between input letter string and dictionary entries, as well as including lexical stress in letter-to-phoneme conversion. Second, w...
متن کامل